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APOPTOSIS INHIBITORY PROTEIN, GENE ENCODING THE PROTEIN 

AND cDNA THEREOF 

Field of the Invention 

5 The present invention relates to a human apoptosis inhibitory protein, and a 

gene encoding the protein and the cDNA thereof. More specifically, the present 
invention relates to the genetic materials which are useful for the elucidation of the 
onset mechanism of various apoptic diseases such as human spinal muscular atropy, 
the diagnosis of the risk of the onset thereof, and the prevention of the onset thereof. 
10 In addition, the materials are useful for the development of clinical techniques and 
pharmaceutical agents for the amelioration and therapeutic treatment of the diseases. 

Prior Art 

Apoptosis is a programmed cellular death, involving observed phenomena 
15 such as the loss of cellular contact with surrounding cells, cytoplasmic condensation, 
chromatin condensation and nuclear condensation with relation to endonuclease 
activity, nuclear fragmentation, membrane-enveloped spherical microbodies, the 
phagocytosis of spherical microbodies with adjacent macrophages or epithelial cells, or 
the fragmentation of the DNA nucleosome unit into DNAs of 180 to 200 bp due to 
20 endonuclease activity. It is suggested that apoptosis is a phagocytic mechanism for 
the final fragment of an apoptic somatic cell under such observed phenomena by 
adjacent cells (see for example Immunology Today 7: 115-119, 1986; Science 
245:301-305, 1989). 

As an apoptosis inhibitory gene, for example, gene bcl-2 has been known. 
25 The gene dc/-2, one of oncogenes discovered in 1985 in alveolar B cytoma, is highly 
expressed in the immune system and nervous system, and it is believed that the 
expression product of the gene serves to maintain the homeostasis of the human 
immune functions and neuronal functions, by inhibiting the apoptosis of the cells 
involved. Additionally because the bcl-2 is expressed in a diversified range in fetuses 
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in particular, the gene is believed to play a significant role in morphological formation 
during ontogenesis. 

Meanwhile, the present inventors have isolated the gene of a neuronal 
apoptosis inhibitory protein (NAIP) from the human chromosome 5q13.1 region as an 
5 etiological gene of a familial hereditary disease spinal muscular atropy (SMA) (Roy et 
al., Cell 80: 167-178, 1995), and have filed a patent application (PCT/CA95/00581). 
More specifically, it is supposed that the mutation of the NAIP gene or the decrease of 
the copy number thereof might cause the apoptosis of spinal neuron, which is an 
etiology of the SMA onset. It is apparently demonstrated that by introducing the NAIP 
10 gene into various cultured cells to give apoptosis-inducing stimulation to the cells, the 
death of the cells is inhibited (Listen et al., Nature 379: 349-353, 1996), which indicates 
that NAIP plays a role of an apoptosis inhibitory factor for not only neuronal cells but 
also overall somatic cells. 

15 Summary of The Invention 

The present inventors have further promoted the analysis of the NAIP gene, 
and they have successfully achieved to clone the full length of cDNA of NAIP gene and 
to identify the protein encoded in the cDNA. 

It is an object of the present invention to provide the cDNA of NAIP gene thus 
20 found by the present inventors, genetic materials with relation to the cDNA and the 
expression products thereof and the like in industrially applicable forms. 

An invention provided by the present application is a human apoptosis 
inhibitory protein which comprises the amino acid sequence of SQ ID No:1 , or an amino 
acid sequence with deletion, substitution or addition of a single or plural amino acids in 
25 SQIDNo:1. 

Another invention is a human apoptosis inhibitory protein comprising the 
amino acid sequence of SQ ID No:3, or an amino acid sequence with deletion, 
substitution or addition of a single or plural amino acids in SQ ID No:3. 

Other inventions are a human gene encoding the human apoptosis inhibitory 
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proteins, cDNAs of said human gene which comprises at least the nucleotide sequence 
for the coding region of SQ ID No:2 or NO:4. 

Still additionally, inventions of this application are an antibody against the 
human apoptosis inhibitory proteins, a non-human animal gene to which the above 
5 cDNAs are hybridized, recombinant vector carrying the cDNAs or a partial sequence 
thereof, a DNA probe comprising a partial sequence of the cDNAs, and a set of PGR 
primer corresponding to partial sequences of the cDNAs. 

The present inventions will now be described below in more detail with 
reference to embodiments. 

10 

Brief Description of Drawing 
Fig.1 schematically depicts the individual 3'-terminal structures of the 
conventionally known apoptosis inhibitory gene NAIPs and the inventive genes NAIPm 
and NAIPl. 

15 

Detailed Description of The Invention 
The human apoptosis inhibitory protein of the present invention is a human 
protein comprising the amino acid sequence of SQ ID No.1 or 3. A peptide (with 5 
amino acid residues or more) consisting of any partial amino acid sequence of the 
20 amino acid sequence of SQ ID No.1 or 3 is included in the scope of this protein. Such 
peptide may be used as an antigen to prepare an antibody, for example. Furthermore, 
the protein of the present invention includes fusion proteins with other proteins (for 
example, fluorescent proteins). 

According to known methods, the protein of the present invention may be 
25 isolated from human organs or cell lines. When intending to use the protein as a 
peptide, the protein may be prepared on the basis of the amino acid sequences 
provided by the present invention by chemical synthesis. Otherwise, the protein may 
be obtained through in vitro transcription or a recombinant DNA technique by using a 
cDNA fragment provided by the present invention. In order to obtain the protein by 
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the recombinant DNA technique, for example, the protein of the present invention may 
be expressed at a large scale from a host cell {Escherichia coli, Bacillus subtilis, yeast, 
animal or plant cells, etc.) which has been transformed by a recombinant vector 
prepared by inserting the cDNA fragment of the present invention in an appropriate 
5 expression vector. For expressing the protein in a microorganism such as Escherichia 
coh\ more specifically, the cDNA of the present invention is inserted within an 
expression vector having an origin suitable for the microorganism, a promoter 
sequence, a ribosome-binding site, DNA cloning sites, a terminator sequence and the 
like to prepare an expression vector, which is used to transform a host cell and 

10 thereafter culture the resulting transformant, whereby a protein encoded by the cDNA 
can be produced in the microorganism at a large scale. Otherwise, the protein may be 
expressed in the form of a fused protein with other proteins. By hydrolyzing the 
resulting fused protein with an appropriate protease, a protein part encoded by the 
cDNA may be recovered. For intending to allow the protein of the present invention to 

15 be expressed and secreted in an animal cell, alternatively, the cDNA fragment is 
inserted within an animal cell expression vector with an animal cell promoter, a splicing 
region, a poly(A) additional site, and the like, the protein of the present invention may 
be expressed in the animal cell. 

The gene of the present invention is derived from humans and other 

20 mammals and encodes the protein, and can be isolated from the known genomic 
libraries by using the cDNA of the present invention or a partial sequence thereof as 
the probe. 

The cDNA of the present invention comprises the nucleotide sequence of 
SQ ID No.2 or 4. The cDNAs of the nucleotide sequences of SQ ID Nos.2 and 4 
25 encode the proteins of the amino acid sequences of SQ ID Nos. 1 and 3, respectively. 

Because the protein of the present invention is expressed in any human 
tissue, a clone identical to the cDNA of the present invention may readily be recovered 
by screening human cDNA libraries by using an oligonucleotide probe synthesized on 
the basis of the nucleotide sequence of the cDNA of SQ ID No.2 or 4. Otherwise, the 
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objective cDNA may be synthesized by polymerase chain reaction (PGR) by using such 
oligonucleotides as primers. Generally, it is frequently observed that human genes 
have polymorphism due to differences of individual nucleotide. Thus, cDNAs in which 
the addition and deletion of a single or plural nucleotides and/or the substitution with a 
5 single or plural nucleotides occur in SQ ID No.2 or 4 are also encompassed within the 
scope of the present invention. Similarly, proteins in which the addition and deletion 
of a single or plural amino acid residues and/or the substitution with a single or plural 
amino acid residues occur due to such modification are also encompassed within the 
scope of the present invention, as long as the proteins have the activities of the protein 

10 with the amino acid sequence of SQ ID No,1 or 3. 

Additionally, the partial sequence of the cDNA of the present invention is a 
continuous sequence of 10 bp or more in the nucleotide sequence of SQ ID No.2 or 4, 
and DNA fragments (sense chain and antisense chain) comprising such continuous 
sequence are also encompassed within the scope of the present invention. These 

15 DNA fragments may be used as probes for genetic diagnosis, for example. 

Furthermore, the antibody of the present invention may be prepared in the 
form of a polyclonal antibody or monoclonal antibody, by known methods by using the 
protein described above of itself or a partial peptide thereof as an antigen. 

The present invention will now be described more specifically in more detail 

20 in examples, but the invention is not limited to the following examples. 

Examples 

Example 1: Screening of cDNA library 

Exxon 16 of the NAIP gene was PGR amplified by using the oligonucleotides 
25 of SQ ID Nos.5 and 6 as primers. PGR conditions were as follows; 94 °G for 15 
seconds. 56 °G for 30 seconds and 
72 °C for one minute. 

By using the resulting PGR product, then, the cDNA library of human fetal 
brain (NA 937227; Stratagene) was screened. As a result, eight clones with overlaps 
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with the NAIP gene were identified. 

As a result of the sequence analysis, the eight cDNA clones were separated 
into seven clones having the same coding region at the 3' tennini and one clone 
comprising a shorter DNA fragment than those of the seven clones. Based on the 
5 length of the DNA fragments, furthermore, it was identified that the genes encoding 
these clones were longer DNA molecules than the NAIP gene previously reported. 

For convenience, hereinafter, the conventionally known NAIP gene is 
referred to as NAIPs; the gene encoding the longer cDNA thus screened is referred to 
as NAIPl; and the shorter gene is referred to as NAIPm- 

10 

Example 2: Sequencing of the cDNAs 

The nucleotide sequences of the cDNA clones identified in Example 1 were 
determined. By using the sequences determined by using the oligonucleotides of SQ 
ID Nos.7 and 8 as primary primers, additional primers were sequentially prepared, to 

15 determine the full sequences of the cDNAs by the walking method. 

Consequently, it is confirmed that the conventionally known exons of NAIPs 
(upper column, Fig.1) is inaccurate. NAIPm and NAIPl do not have exon 1 of NAIPs 
and have a new exon (153 bp) between the exons 14 and 15 of the NAIPs (middle and 
lower columns, Fig.1). Additionally, it is confirmed that NAIPl have an additional exon 

20 at the 3' terminus of the NAIPm (lower columns, Fig,1). 

In other words, the NAIP is expressed in two splice variant forms, NAIPm with 
exons 1 to 16 and NAIPl with exons 1 to 17. In more detail, NAIPm has the novel exon 
14 and additionally contains extra 39 bp at the 3' temriinus of the exon 16, while the 
cDNA thereof has the nucleotide sequence of SQ ID No.4 and encodes the protein of 

25 the amino acid sequence of SQ ID No.3. On the other hand, NAIPl contains exon 17 
of 363 bp in addition to the exon 14, while the cDNA thereof has the nucleotide 
sequence of SQ ID No,2 and encodes the protein of the amino acid sequence of SQ ID 
No.1. 

Based on the aforementioned results, it is verified that the apoptosis 
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inhibitory genes NAIPm and NAIPl of the present invention are novel genes, apparently 
different from the conventionally known gene NAIPs; and that the apoptosis inhibitory 
proteins encoded by these genes are novel proteins. 

5 Example 3: Expression of protein in Escherichia coli 

A translated region was PGR amplified by using an NAIPt-containing clone 
isolated in Example 1 as template. The resulting PGR product was inserted into an 
expression vector for Escherichia coli, and after confirming the nucleotide sequence of 
the insert, the host Escherichia coli was transformed with the vector. The 

10 transformant was cultured in an LB culture medium at 37 °C for 5 hours, followed by 
addition of IPTG to a final concentration of 0.4 mM and subsequent additional culturing 
at 37 °G for 2.5 hours. The bacteria were centrifuged and isolated, and were then 
dissolved in a dissolving solution, and the resulting solution was once frozen at -80 °G 
and thawed, for ultrasonic disruption. The solution in disruption was centrifuged, and 

15 from the resulting supernatant was isolated and purified a protein, which was 
recovered as the apoptosis inhibitory protein (SQ ID No.1) of the present invention. 

Example 4: Preparation of antibody 

A rabbit was immunized with the protein obtained in Example 3 as an antigen, 
20 to prepare an anti-serum. From the antiserum was first removed a 40 %-saturated 
ammonium sulfate precipitate fraction on a GST affinity column. The pass-through 
fraction was further purified on an antigen column GST-HP10345. 

As has been described above, the novel apoptosis inhibitory proteins, the 
25 gene encoding the proteins and the cDNAs thereof are provided in accordance with the 
present invention, whereby the elucidation of the onset mechanism of various apoptic 
diseases primarily including human spinal muscular atropy, the diagnosis of the risk of 
the onset thereof, the prevention of the onset thereof and the amelioration of the 
diseased conditions, and the development of clinical techniques and pharmaceutical 
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agents for the therapeutic treatment, can be attained. 

Sequence Listing 
<110> Japan Science and Technology Corporation 

5 

<120> Apoptosis inhibitory Protein, Gene Encoding The Protein 
and cDNA thereof 

<140> 

10 <141> 1999-01-29 

<150> 
<151> 

15 <160> 8 

<170> Patent In Ver. 2.0 

<210> 1 

<211> 1403 

<212> PRT 

20 <213> Homo sapiens 

<400> 1 

Met Ala Thr Gin Gin Lys Ala Ser Asp Glu Arg lie Ser Gin Phe Asp 
15 10 15 

25 His Asn Leu Leu Pro Glu Leu Ser Ala Leu Leu Gly Leu Asp Ala Val 
20 25 30 

Gin Leu Ala Lys Glu Leu Glu Glu Glu Glu Gin Lys Glu Arg Ala Lys 

35 40 45 

Met Gin Lys Gly Tyr Asn Ser Gin Met Arg Ser Glu Ala Lys Arg Leu 
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50 55 60 

Lys Thr Phe Val Thr Tyr Glu Pro Tyr Ser Ser Trp Me Pro Gin Glu 
65 70 75 80 

Met Ala Ala Ala Gly Phe Tyr Phe Thr Gly Val Lys Ser Gly I le Gin 
5 85 90 95 

Cys Phe Cys Cys Ser Leu I le Leu Phe Gly Ala Gly Leu Thr Arg Leu 

100 105 110 

Pro Me Glu Asp His Lys Arg Phe His Pro Asp Cys Gly Phe Leu Leu 
115 120 125 

10 Asn Lys Asp Val Gly Asn Me Ala Lys Tyr Asp I le Arg Val Lys Asn 
130 135 140 

Leu Lys Ser Arg Leu Arg Gly Gly Lys Met Arg Tyr Gin Glu Glu Glu 
145 150 155 160 

Ala Arg Leu Ala Ser Phe Arg Asn Trp Pro Phe Tyr Val Gin Gly I le 
15 165 170 175 

Ser Pro Cys Val Leu Ser Glu Ala Gly Phe Val Phe Thr Gly Lys Gin 

180 185 190 

Asp Thr Val Gin Cys Phe Ser Cys Gly Gly Cys Leu Gly Asn Trp Glu 
1 95 200 205 

20 Glu Gly Asp Asp Pro Trp Lys Glu His Ala Lys Trp Phe Pro Lys Cys 
210 215 220 

Glu Phe Leu Arg Ser Lys Lys Ser Ser Glu Glu Me Thr Gin Tyr Me 
225 230 235 240 

Gin Ser Tyr Lys Gly Phe Val Asp Me Thr Gly Glu His Phe Val Asn 
25 245 250 255 

Ser Trp Val Gin Arg Glu Leu Pro Met Ala Ser Ala Tyr Cys Asn Asp 

260 265 270 

Ser I le Phe Ala Tyr Glu Glu Leu Arg Leu Asp Ser Phe Lys Asp Trp 
275 280 285 
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Pro Arg Glu Ser Ala Val Gly Val Ala Ala Leu Ala Lys Ala Gly Leu 

290 295 300 

Phe Tyr Thr Gly Me Lys Asp Me Val Gin Cys Phe Ser Cys Gly Gly 
305 310 315 320 

5 Cys Leu Glu Lys Trp Gin Glu Gly Asp Asp Pro Leu Asp Asp His Thr 

325 330 335 

Arg Cys Phe Pro Asn Cys Pro Phe Leu Gin Asn Met Lys Ser Ser Ala 

340 345 350 

Glu Val Thr Pro Asp Leu Gin Ser Arg Gly Glu Leu Cys Glu Leu Leu 
10 355 360 365 

Glu Thr Thr Ser Glu Ser Asn Leu Glu Asp Ser Me Ala Val Gly Pro 

370 375 380 

Me Val Pro Gtu Met Ala Gin Gly Glu Ala Gin Trp Phe Gin Glu Ala 
385 390 395 400 

15 Lys Asn Leu Asn Glu Gin Leu Arg Ala Ala Tyr Thr Ser Ala Ser Phe 

405 410 415 

Arg His Met Ser Leu Leu Asp I le Ser Ser Asp Leu Ala Thr Asp His 

420 425 430 

Leu Leu Gly Cys Asp Leu Ser I le Ala Ser Lys His I le Ser Lys Pro 
20 435 440 445 

Val Gin Glu Pro Leu Val Leu Pro Glu Val Phe Gly Asn Leu Asn Ser 

450 455 460 

Val Met Cys Val Glu Gly Glu Ala Gly Ser Gly Lys Thr Val Leu Leu 
465 470 475 480 

25 Lys Lys Me Ala Phe Leu Trp Ala Ser Gly Cys Cys Pro Leu Leu Asn 

485 490 495 

Arg Phe Gin Leu Val Phe Tyr Leu Ser Leu Ser Ser Thr Arg Pro Asp 

500 505 510 

Glu Gly Leu Ala Ser Me Me Cys Asp Gin Leu Leu Glu Lys Glu Gly 
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515 520 525 

Ser Val Thr Glu Met Cys Met Arg Asn i I Me Gin Gin Leu Lys Asn 

530 535 540 

Gin Val Leu Phe Leu Leu Asp Asp Tyr Lys Glu I le Cys Ser I le Pro 
5 545 550 555 560 

Gin Val I le Gly Lys Leu Me Gin Lys Asn His Leu Ser Arg Thr Cys 

565 570 575 

Leu Leu I le Ala Val Arg Thr Asn Arg Ala Arg Asp I le Arg Arg Tyr 
580 585 590 

10 Leu Glu Thr Me Leu Glu I le Lys Ala Phe Pro Phe Tyr Asn Thr Val 
595 600 605 

Cys Me Leu Arg Lys Leu Phe Ser His Asn Met Thr Arg Leu Arg Lys 

610 615 620 

Phe Met Val Tyr Phe Gly Lys Asn Gin Ser Leu Gin Lys Me Gin Lys 
15 625 630 635 640 

Thr Pro Leu Phe Val Ala Ala Me Cys Ala His Trp Phe Gin Tyr Pro 

645 650 655 

Phe Asp Pro Ser Phe Asp Asp Val Ala Val Phe Lys Ser Tyr Met Glu 
660 665 670 

20 Arg Leu Ser Leu Arg Asn Lys Ala Thr Ala Glu Me Leu Lys Ala Thr 
675 680 685 

Val Ser Ser Cys Gly Glu Leu Ala Leu Lys Gly Phe Phe Ser Cys Cys 

690 695 700 

Phe Glu Phe Asn Asp Asp Asp Leu Ala Glu Ala Gly Val Asp Glu Asp 
25 705 710 715 720 

Glu Asp Leu Thr Met Cys Leu Met Ser Lys Phe Thr Ala Gin Arg Leu 

725 730 735 

Arg Pro Phe Tyr Arg Phe Leu Ser Pro Ala Phe Gin Glu Phe Leu Ala 
740 745 750 
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Gly Met Arg Leu Me Glu Leu Leu Asp Ser Asp Arg Gin Glu His Gin 

755 760 765 

Asp Leu Gly Leu Tyr His Leu Lys Gin lie Asn Ser Pro Met Met Thr 
770 775 780 

5 Va I Ser Ala Tyr Asn Asn Phe Leu Asn Tyr Val Ser Ser Leu Pro Ser 
785 790 795 800 

Thr Lys Ala Gly Pro Lys Me Val Ser His Leu Leu His Leu Val Asp 

805 810 815 

Asn Lys Glu Ser Leu Glu Asn Me Ser Glu Asn Asp Asp Tyr Leu Lys 
10 820 825 830 

His Gin Pro Glu I le Ser Leu Gin Met Gin Leu Leu Arg Gly Leu Trp 

835 840 845 

Gin Me Cys Pro Gin Ala Tyr Phe Ser Met Val Ser Glu His Leu Leu 
850 855 860 

15 Val Leu Ala Leu Lys Thr Ala Tyr Gin Ser Asn Thr Val Ala Ala Cys 
865 870 875 880 

Ser Pro Phe Val Leu Gin Phe Leu Gin Gly Arg Thr Leu Thr Leu Gly 

885 890 895 

Ala Leu Asn Leu Gin Tyr Phe Phe Asp His Pro Glu Ser Leu Ser Leu 
20 900 905 910 

Leu Arg Ser Me His Phe Pro Me Arg Gly Asn Lys Thr Ser Pro Arg 

915 920 925 

Ala His Phe Ser Val Leu Glu Thr Cys Phe Asp Lys Ser Gin Val Pro 
930 935 940 

25 Thr Me Asp Gin Asp Tyr Ala Ser Ala Phe Glu Pro Met Asn Glu Trp 
945 950 955 960 

Glu Arg Asn Leu Ala Glu Lys Glu Asp Asn Val Lys Ser Tyr Met Asp 

965 970 975 

Met Gin Arg Arg Ala Ser Pro Asp Leu Ser Thr Gly Tyr Trp Lys Leu 
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980 985 990 

Ser Pro Lys Gin Tyr Lys Me Pro Cys Leu Glu Val Asp Va I Asn Asp 

995 1000 1005 

Me Asp Val Val Gly Gin Asp Met Leu Glu Me Leu Met Thr Val Phe 
5 1010 1015 1020 

Ser Ala Ser Gin Arg Me Glu Leu His Leu Asn His Ser Arg Gly Phe 
025 1030 1035 1040 

Me Glu Ser Me Arg Pro Ala Leu Glu Leu Ser Lys Ala Ser Val Thr 
1045 1050 1055 

10 Lys Cys Ser Me Ser Lys Leu Glu Leu Ser Ala Ala Glu Gin Glu Leu 
1060 1065 1070 

Leu Leu Thr Leu Pro Ser Leu Glu Ser Leu Glu Val Ser Gly Thr Me 

1075 1080 1085 

Gin Ser Gin Asp Gin Me Phe Pro Asn Leu Asp Lys Phe Leu Cys Leu 
15 1090 1095 1100 

Lys Glu Leu Ser Val Asp Leu Glu Gly Asn Me Asn Val Phe Ser Val 
105 1110 1115 1120 

Me Pro Glu Glu Phe Pro Asn Phe His His Met Glu Lys Leu Leu Me 
1125 1130 1135 

20 Gin Me Ser Ala Glu Tyr Asp Pro Ser Lys Leu Val Lys Leu Me Gin 
1140 1145 1150 

Asn Ser Pro Asn Leu His Val Phe His Leu Lys Cys Asn Phe Phe Ser 

1155 1160 1165 

Asp Phe Gly Ser Leu Met Thr Met Leu Val Ser Cys Lys Lys Leu Thr 
25 1170 1175 1180 

Glu Me Lys Phe Ser Asp Ser Phe Phe Gin Ala Val Pro Phe Val Ala 
185 1190 1195 1200 

Ser Leu Pro Asn Phe I le Ser Leu Lys I le Leu Asn Leu Glu Gly Gin 
1205 1210 1215 
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Gin Phe Pro Asp Glu Glu Thr Ser Glu Lys Phe Ala Tyr lie Leu Gly 

1220 1225 1230 

Ser Leu Ser Asn Leu Glu Glu Leu I le Leu Pro Thr Gly Asp Gly Me 
1235 1240 1245 

5 Tyr Arg Val Ala Lys Leu lie lie Gin Gin Cys Gin Gin Leu His Cys 
1250 1255 1260 

Leu Arg Val Leu Ser Phe Phe Lys Thr Leu Asn Asp Asp Ser Val Val 
265 1270 1275 1280 

Glu Me Ala Lys Val Ala Me Ser Gly Gly Phe Gin Lys Leu Glu Asn 
10 1285 1290 1295 

Leu Lys Leu Ser Me Asn His Lys i le Thr Glu Glu Gly Tyr Arg Asn 

1300 1305 1310 

Phe Phe Gin Ala Leu Asp Asn Met Pro Asn Leu Gin Glu Leu Asp I le 
1315 1320 1325 

15 Ser Arg His Phe Thr Glu Cys Me Lys Ala Gin Ala Thr Thr Val Lys 
1330 1335 1340 

Ser Leu Ser Gin Cys Val Leu Arg Leu Pro Arg Leu I le Arg Leu Asn 
345 1350 1355 1360 

Met Leu Ser Trp Leu Leu Asp Ala Asp Asp i le Ala Leu Leu Asn Val 
20 1365 1370 1375 

Met Lys Glu Arg His Pro Gin Ser Lys Tyr Leu Thr I le Leu Gin Lys 

1380 1385 1390 

Trp I le Leu Pro Phe Ser Pro Me Me Gin Lys 
1395 1400 

25 

<210> 2 
<211> 5984 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 

<222> (292) . . (4500) 

5 

<4GG> 2 

acaaaaggtc ctgtgctcac ctgggaccct tctggacgtt gccctgtgtt cctcttcgcc 60 
tgcctgttca tctacgacga accccgggta ttgaccccag acaacaatgc cacttcatat 120 
tggggacttc gtctgggatt ccaaggtgca ttcattgcaa agttccttaa atattttctc 180 
10 actgcttcct actaaaggac ggacagagca tttgttcttc agccacatac tttccttcca 240 
ctggccagca ttctcctcta ttagactaga actgtggata aacctcagaa a atg gcc 297 

Met Ala 
1 

acc cag cag aaa gcc tct gac gag agg ate tec cag ttt gat cac aat 345 
15 Thr Gin Gin Lys Ala Ser Asp Glu Arg I le Ser Gin Phe Asp His Asn 
5 10 15 

ttg ctg oca gag ctg tct get ctt ctg ggc eta gat gea gtt eag ttg 393 
Leu Leu Pro Glu Leu Ser Ala Leu Leu Gly Leu Asp Ala Va I Gin Leu 
20 25 30 

20 gca aag gaa eta gaa gaa gag gag cag aag gag cga gea aaa atg cag 441 
Ala Lys Glu Leu Glu Glu Glu Glu Gin Lys Glu Arg Ala Lys Met Gin 
35 40 45 50 

aaa gge tac aac tet eaa atg ego agt gaa gca aaa agg tta aag act 489 
Lys Gly Tyr Asn Ser Gin Met Arg Ser Glu Ala Lys Arg Leu Lys Thr 
25 55 60 65 

ttt gtg act tat gag ceg tac age tea tgg ata eca eag gag atg gcg 537 
Phe Val Thr Tyr Glu Pro Tyr Ser Ser Trp I le Pro Gin Glu Met Ala 

70 75 80 

gee get ggg ttt tac ttc act ggg gta aaa tct ggg att cag tge ttc 585 
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Ala Ala Gly Phe Tyr Phe Thr Gly Va I Lys Ser Gly Me Gin Cys Phe 

85 90 95 

tgc tgt age eta ate cte ttt ggt gee ggc etc acg aga etc eee ata 633 
Cys Cys Ser Leu I le Leu Phe Gly Ala Gly Leu Thr Arg Leu Pro I le 
5 100 105 110 

gaa gac cac aag agg ttt cat cca gat tgt ggg ttc ctt ttg aac aag 681 
Glu Asp His Lys Arg Phe His Pro Asp Cys Gly Phe Leu Leu Asn Lys 
115 120 125 130 

gat gtt ggt aac att gee aag tac gac ata agg gtg aag aat etg aag 729 
10 Asp Val Gly Asn I le Ala Lys Tyr Asp I le Arg Val Lys Asn Leu Lys 

135 140 145 

age agg etg aga gga ggt aaa atg agg tac caa gaa gag gag get aga 777 
Ser Arg Leu Arg Gly Gly Lys Met Arg Tyr Gin Glu Glu Glu Ala Arg 
1 50 155 1 60 

15 ctt gca tec ttc agg aac tgg cca ttt tat gtc caa ggg ata tec cct 825 
Leu Ala Ser Phe Arg Asn Trp Pro Phe Tyr Val Gin Gly Me Ser Pro 

165 170 175 

tgt gtg etc tea gag get ggc ttt gtc ttt aca ggt aaa cag gac acg 873 
Cys Val Leu Ser Glu Ala Gly Phe Val Phe Thr Gly Lys Gin Asp Thr 
20 180 185 190 

gta cag tgt ttt tec tgt ggt gga tgt tta gga aat tgg gaa gaa gga 921 
Val Gin Cys Phe Ser Cys Gly Gly Cys Leu Gly Asn Trp Glu Glu Gly 
195 200 205 210 

gat gat cct tgg aag gaa cat gee aaa tgg ttc eee aaa tgt gaa ttt 969 
25 Asp Asp Pro Trp Lys Glu His Ala Lys Trp Phe Pro Lys Cys Glu Phe 

215 220 225 

ctt egg agt aag aaa tec tea gag gaa att ace cag tat att caa age 1017 
Leu Arg Ser Lys Lys Ser Ser Glu Glu Me Thr Gin Tyr Me Gin Ser 
230 235 240 
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tac aag gga ttt gtt gac at a acg gga gaa cat ttt gtg aat tec tgg 1065 
Tyr Lys Gly Phe Val Asp Me Thr Gly Glu His Phe Val Asn Ser Trp 

245 250 255 

gtc cag aga gaa tta act atg gca tea get tat tgc aat gac age ate 1113 
5 Val Gin Arg Glu Leu Pro Met Ala Ser Ala Tyr Cys Asn Asp Ser Me 
260 265 270 

ttt get tac gaa gaa eta egg etg gac tet ttt aag gac tgg cec egg 1161 
Phe Ala Tyr Glu Glu Leu Arg Leu Asp Ser Phe Lys Asp Trp Pro Arg 
275 280 285 290 

10 gaa tea get gtg gga gtt gca gca etg gee aaa gca ggt ctt tte tac 1209 
Glu Ser Ala Val Gly Val Ala Ala Leu Ala Lys Ala Gly Leu Phe Tyr 

295 300 305 

aca ggt ata aag gac ate gtc cag tgc ttt tec tgt gga ggg tgt tta 1257 
Thr Gly I le Lys Asp I le Val Gin Cys Phe Ser Cys Gly Gly Cys Leu 
15 310 315 320 

gag aaa tgg cag gaa ggt gat gac eca tta gac gat cac ace aga tgt 1305 
Glu Lys Trp Gin Glu Gly Asp Asp Pro. Leu Asp Asp His Thr Arg Cys 

325 330 335 

ttt cec aat tgt eca ttt etc caa aat atg aag tec tet geg gaa gtg 1353 
20 Phe Pro Asn Cys Pro Phe Leu Gin Asn Met Lys Ser Ser Ala Glu Val 
340 345 350 

act eca gac ctt cag age cgt ggt gaa ctt tgt gaa tta etg gaa ace 1401 
Thr Pro Asp Leu Gin Ser Arg Gly Glu Leu Cys Glu Leu Leu Glu Thr 
355 360 365 370 

25 aca agt gaa age aat ctt gaa gat tea ata gca gtt ggt ect ata gtg 1449 
Thr Ser Glu Ser Asn Leu Glu Asp Ser lie Ala Val Gly Pro Me Val 

375 380 385 

eca gaa atg gca cag ggt gaa gcc cag tgg ttt caa gag gca aag aat 1497 
Pro Glu Met Ala Gin Gly Glu Ala Gin Trp Phe Gin Glu Ala Lys Asn 
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390 395 400 

ctg aat gag cag ctg aga gca get tat acc age gee agt ttc ege cac 
Leu Asn Glu Gin Leu Arg Ala Ala Tyr Thr Ser Ala Ser Phe Arg His 
405 410 415 

5 atg tet ttg ett gat ate tet tee gat etg gee aeg gae eae ttg etg 
Met Ser Leu Leu Asp Me Ser Ser Asp Leu Ala Thr Asp His Leu Leu 

420 425 430 

gge tgt gat etg tet att get tea aaa cac ate ago aaa cct gtg caa 
Gly Cys Asp Leu Ser I le Ala Ser Lys His I le Ser Lys Pro Val Gin 
10 435 440 445 450 

gaa eet etg gtg etg eet gag gte ttt gge aae ttg aae tet gte atg 
Glu Pro Leu Val Leu Pro Glu Val Phe Gly Asn Leu Asn Ser Val Met 

455 460 465 

tgt gtg gag ggt gaa get gga agt gga aag acg gte etc ctg aag aaa 
15 Cys Val Glu Gly Glu Ala Gly Ser Gly Lys Thr Val Leu Leu Lys Lys 
470 475 480 

ata get ttt etg tgg gea tet gga tge tgt eee etg tta aae agg tte 
Me Ala Phe Leu Trp Ala Ser Gly Cys Cys Pro Leu Leu Asn Arg Phe 
485 490 495 

20 cag ctg gtt tte tae ete tee ett agt tec aee aga eea gae gag ggg 
Gin Leu Val Phe Tyr Leu Ser Leu Ser Ser Thr Arg Pro Asp Glu Gly 

500 505 510 

etg gee agt ate ate tgt gae eag ete eta gag aaa gaa gga tet gtt 
Leu Ala Ser Me Me Cys Asp Gin Leu Leu Glu Lys Glu Gly Ser Val 
25 515 520 525 530 

act gaa atg tge atg agg aae att ate eag eag tta aag aat eag gte 
Thr Glu Met Cys Met Arg Asn Me Me Gin Gin Leu Lys Asn Gin Val 

535 540 545 

tta tte ett tta gat gae tae aaa gaa ata tgt tea ate eet eaa gte 



1545 



1593 



1641 



1689 



1737 



1785 



1833 



1881 



1929 



1977 
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Leu Phe Leu 

ata gga aaa 
Me Gly Lys 
5 565 
att get gtc 
1 le Ala Val 

580 
acc att eta 
10 Thr Me Leu 
595 

tta egg aag 
Leu Arg Lys 

15 gtt tac ttt 
Val Tyr Phe 

etc ttt gtg 
Leu Phe Val 

20 645 
eea tec ttt 
Pro Ser Phe 

660 
tec tta agg 

25 Ser Leu Arg 
675 

tee tgt ggt 
Ser Cys Gly 



Leu Asp Asp Tyr Lys 
550 

etg att caa aaa aac 
Leu lie Gin Lys Asn 
570 

egt aca aae agg gee 
Arg Thr Asn Arg Ala 
585 

gag ate aaa gca ttt 
Glu Me Lys Ala Phe 
600 

etc ttt tea cat aat 
Leu Phe Ser His Asn 
615 

gga aag aac caa agt 
Gly Lys Asn Gin Ser 
630 

gcg gcg ate tgt get 
Ala Ala I le Cys Ala 
650 

gat gat gtg get gtt 
Asp Asp Val Ala Val 
665 

aae aaa gcg aca get 
Asn Lys Ala Thr Ala 
680 

gag etg gee ttg aaa 
Glu Leu Ala Leu Lys 
695 



Glu Me Cys Ser Me 
555 

cac tta tec egg acc 
His Leu Ser Arg Thr 
575 

agg gac ate egc cga 
Arg Asp I le Arg Arg 
590 

ecc ttt tat aat act 
Pro Phe Tyr Asn Thr 
605 

atg act egt etg cga 
Met Thr Arg Leu Arg 
620 

ttg cag aag ata cag 
Leu Gin Lys Me Gin 
635 

cat tgg ttt cag tat 
His Trp Phe Gin Tyr 
655 

ttc aag tee tat atg 
Phe Lys Ser Tyr Met 
670 

gaa att etc aaa gca 
Glu Me Leu Lys Ala 
685 

ggg ttt ttt tea tgt 
Gly Phe Phe Ser Cys 
700 



Pro Gin Val 
560 

tge eta ttg 2025 
Cys Leu Leu 

tac eta gag 2073 
Tyr Leu Glu 

gtc tgt ata 2121 
Val Cys I le 
610 

aag ttt atg 2169 
Lys Phe Met 

625 

aaa act eet 2217 
Lys Thr Pro 
640 

eet ttt gae 2265 
Pro Phe Asp 

gaa egc ctt 2313 
Glu Arg Leu 

act gtg tec 2361 
Thr Val Ser 
690 

tge ttt gag 2409 
Cys Phe Glu 
705 
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ttt aat gat gat gat etc gca gaa gca ggg gtt gat gaa gat gaa gat 2457 
Phe Asn Asp Asp Asp Leu Ala Glu Ala Gly Val Asp Glu Asp Glu Asp 

710 715 720 

eta ace atg tgc ttg atg age aaa ttt aea gee cag aga eta aga eca 2505 
5 Leu Thr Met Cys Leu Met Ser Lys Phe Thr Ala Gin Arg Leu Arg Pro 
725 730 735 

ttc tac egg ttt tta agt cet gee tte eaa gaa ttt ett gcg ggg atg 2553 
Phe Tyr Arg Phe Leu Ser Pro Ala Phe Gin Glu Phe Leu Ala Gly Met 
740 745 750 

10 agg ctg att gaa etc ctg gat tea gat agg cag gaa cat caa gat ttg 2601 
Arg Leu I le Glu Leu Leu Asp Ser Asp Arg Gin Glu His Gin Asp Leu 
755 760 765 770 

gga ctg tat cat ttg aaa eaa ate aac tea cee atg atg act gta age 2649 
Gly Leu Tyr His Leu Lys Gin Me Asn Ser Pro Met Met Thr Val Ser 
15 775 780 785 

gee tac aac aat ttt ttg aac tat gtc tec age etc cet tea aea aaa 
Ala Tyr Asn Asn Phe Leu Asn Tyr Val Ser Ser Leu Pro Ser Thr Lys 

790 795 800 

gea ggg ecc aaa att gtg tet eat ttg etc eat tta gtg gat aac aaa 
20 Ala Gly Pro Lys Me Val Ser His Leu Leu His Leu Val Asp Asn Lys 
805 810 815 

gag tea ttg gag aat ata tet gaa aat gat gae tac tta aag eac cag 2793 
Glu Ser Leu Glu Asn Me Ser Glu Asn Asp Asp Tyr Leu Lys His Gin 
820 825 830 

25 eca gaa att tea ctg cag atg cag tta ett agg gga ttg tgg eaa att 2841 
Pro Glu Me Ser Leu Gin Met Gin Leu Leu Arg Gly Leu Trp Gin Me 
835 840 845 850 

tgt eca eaa get tac ttt tea atg gtt tea gaa cat tta ctg gtt ett 
Cys Pro Gin Ala Tyr Phe Ser Met Val Ser Glu His Leu Leu Val Leu 



2697 



2745 



2889 



20 



2985 



3033 



855 860 865 

gcc ctg aaa act get tat caa age aac act gtt get gcg tgt tet cca 2937 
Ala Leu Lys Thr Ala Tyr Gin Ser Asn Thr Val Ala Ala Cys Ser Pro 
870 875 880 

5 ttt gtt ttg caa tte ett caa ggg aga aca ctg act ttg ggt gcg ctt 
Phe Val Leu Gin Phe Leu Gin Gly Arg Thr Leu Thr Leu Gly Ala Leu 

885 890 895 

aac tta cag tac ttt ttc gac cac cca gaa age ttg tea ttg ttg agg 
Asn Leu Gin Tyr Phe Phe Asp His Pro Glu Ser Leu Ser Leu Leu Arg 
10 900 905 910 

age ate cac ttc cca ata ega gga aat aag aca tea ccc aga gea eat 3081 
Ser Me His Phe Pro Me Arg Gly Asn Lys Thr Ser Pro Arg Ala His 
915 920 925 930 

ttt tea gtt etg gaa aea tgt ttt gae aaa tea eag gtg eea act ata 3129 
15 Phe Ser Val Leu Glu Thr Cys Phe Asp Lys Ser Gin Val Pro Thr Me 

935 940 945 

gat cag gac tat get tet gcc ttt gaa cet atg aat gaa tgg gag ega 3177 
Asp Gin Asp Tyr Ala Ser Ala Phe Glu Pro Met Asn Glu Trp Glu Arg 
950 955 960 

20 aat tta get gaa aaa gag gat aat gta aag age tat atg gat atg cag 3225 
Asn Leu Ala Glu Lys Glu Asp Asn Val Lys Ser Tyr Met Asp Met Gin 

965 970 975 

cgc agg gea tea cca gac ctt agt act ggc tat tgg aaa ctt tet cca 3273 
Arg Arg Ala Ser Pro Asp Leu Ser Thr Gly Tyr Trp Lys Leu Ser Pro 
25 980 985 990 

aag eag tac aag att ccc tgt eta gaa gtc gat gtg aat gat att gat 3321 
Lys Gin Tyr Lys Me Pro Cys Leu Glu Val Asp Val Asn Asp Me Asp 
995 1000 1005 1010 

gtt gta ggc cag gat atg ctt gag att eta atg aca gtt ttc tea get 



3369 



21 



Val Val Gly Gin Asp Met Leu Glu Me Leu Met Thr Val Phe Ser Ala 

1015 1020 1025 

tea cag cgc ate gaa etc cat tta aac cac age aga ggc ttt ata gaa 3417 
Ser Gin Arg Me Glu Leu His Leu Asn His Ser Arg Gly Phe Me Glu 
5 1030 1035 1040 

ago ate cgc cca get ett gag ctg tct aag gee tct gtc ace aag tgc 3465 
Ser Me Arg Pro Ala Leu Glu Leu Ser Lys Ala Ser Val Thr Lys Cys 

1045 1050 1055 

tee ata age aag ttg gaa etc age gca gee gaa cag gaa ctg ett etc 3513 
10 Ser I le Ser Lys Leu Glu Leu Ser Ala Ala Glu Gin Glu Leu Leu Leu 
1060 1065 1070 

ace ctg cet tec ctg gaa tct ett gaa gtc tea ggg aca ate cag tea 3561 
Thr Leu Pro Ser Leu Glu Ser Leu Glu Val Ser Gly Thr Me Gin Ser 
1075 1080 1085 1090 

15 caa gae caa ate ttt ect aat ctg gat aag ttc ctg tgc ctg aaa gaa 
Gin Asp Gin Me Phe Pro Asn Leu Asp Lys Phe Leu Cys Leu Lys Glu 

1095 1100 1105 

ctg tct gtg gat ctg gag ggc aat ata aat gtt ttt tea gtc att cet 
Leu Ser Val Asp Leu Glu Gly Asn I le Asn Val Phe Ser Val I le Pro 
20 1110 1115 1120 

gaa gaa ttt cca aac ttc cac cat atg gag aaa tta ttg ate caa att 
Glu Glu Phe Pro Asn Phe His His Met Glu Lys Leu Leu Me Gin Me 

1125 1130 1135 

tea get gag tat gat ect tec aaa eta gta aaa tta att caa aat tct 3753 
25 Ser Ala Glu Tyr Asp Pro Ser Lys Leu Val Lys Leu Me Gin Asn Ser 
1140 1145 1150 

cca aac ett cat gtt ttc cat ctg aag tgt aac ttc ttt teg gat ttt 3801 
Pro Asn Leu His Val Phe His Leu Lys Cys Asn Phe Phe Ser Asp Phe 
1155 1160 1165 1170 



3609 



3657 



3705 
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ggg tct etc atg act atg ctt gtt tec tgt aag aaa etc aca gaa att 3849 
Gly Ser Leu Met Thr Met Leu Val Ser Gys Lys Lys Leu Thr Glu Me 

1175 1180 1185 

aag ttt teg gat tea ttt ttt caa gee gtc cea ttt gtt gcc agt ttg 3897 
5 Lys Phe Ser Asp Ser Phe Phe Gin Ala Val Pro Phe Val Ala Ser Leu 
1190 1195 1200 

oca aat ttt att tct ctg aag ata tta aat ctt gaa ggc cag caa ttt 3945 
Pro Asn Phe lie Ser Leu Lys lie Leu Asn Leu Glu Gly Gin Gin Phe 
1205 1210 1215 

10 cct gat gag gaa aca tea gaa aaa ttt gee tae att tta ggt tct ctt 3993 
Pro Asp Glu Glu Thr Ser Glu Lys Phe Ala Tyr Me Leu Gly Ser Leu 

1220 1225 1230 

agt aac ctg gaa gaa ttg ate ctt cct act ggg gat gga att tat cga 4041 
Ser Asn Leu Glu Glu Leu I !e Leu Pro Thr Gly Asp Gly lie Tyr Arg 
15 1235 1240 1245 1250 

gtg gee aaa ctg ate ate cag cag tgt eag cag ctt eat tgt etc cga 4089 
Val Ala Lys Leu Me Me Gin Gin Gys Gin Gin Leu His Gys Leu Arg 

1255 1260 1265 

gtc etc tea ttt tte aag act ttg aat gat gae age gtg gtg gaa att 4137 
20 Val Leu Ser Phe Phe Lys Thr Leu Asn Asp Asp Ser Val Val Glu Me 
1270 1275 1280 

gcc aaa gta gca ate agt gga ggt ttc cag aaa ctt gag aac eta aag 4185 
Ala Lys Val Ala Me Ser Gly Gly Phe Gin Lys Leu Glu Asn Leu Lys 
1285 1290 1295 

25 ctt tea ate aat eac aag att aca gag gaa gga tae aga aat tte ttt 4233 
Leu Ser I le Asn His Lys I le Thr Glu Glu Gly Tyr Arg Asn Phe Phe 

1300 1305 1310 

caa gca ctg gac aac atg eca aac ttg cag gag ttg gac ate tec agg 4281 
Gin Ala Leu Asp Asn Met Pro Asn Leu Gin Glu Leu Asp I le Ser Arg 
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1315 1320 1325 1330 

cat ttc aca gag tgt ate aaa get cag gee aea aca gtc aag tot ttg 4329 
His Phe Thr Glu Cys Me Lys Ala Gin Ala Thr Thr Val Lys Ser Leu 
1335 1340 1345 

5 agt eaa tgt gtg tta ega eta cea agg etc att aga ctg aae atg tta 4377 
Ser Gin Cys Val Leu Arg Leu Pro Arg Leu I le Arg Leu Asn Met Leu 

1350 1355 1360 

agt tgg etc ttg gat gca gat gat att goa ttg ctt aat gtc atg aaa 4425 
Ser Trp Leu Leu Asp Ala Asp Asp I le Ala Leu Leu Asn Val Met Lys 
10 1365 1370 1375 

gaa aga cat cet eaa tct aag tac tta act att etc cag aaa tgg ata 4473 
Glu Arg His Pro Gin Ser Lys Tyr Leu Thr I le Leu Gin Lys Trp I le 

1380 1385 1390 

ctg ccg ttc tct cca ate att cag aaa taa aagattcagc taaaaactgc 4523 
15 Leu Pro Phe Ser Pro lie lie Gin Lys 
1395 1400 

tgaatcaata atttgtettg gggeatattg aggatgtaaa aaaagttgtt gattaatget 4583 
aaaaaecaaa ttatccaaaa ttattttatt aaatattgea tacaaaagaa aatgtgtaag 4643 
gcttgctaaa aaaeaaaaca aaaeaaaaca eagtcetgea tactcaccac caagctcaag 4703 

20 aaataaatea teaccaatae etttgaggtc ectgagtaat eeaecccage taaaggcaaa 4763 
cccttcaatc aagtttatac agcaaaccct ccattgtcca tggtcaacag ggaaggggtt 4823 
ggggacaggt ctgecaatet atctaaaage cacaatatgg aagaagtatt caatttatat 4883 
aataaatgge taaettaacg gttgaateae tttcatacat ggatgaaaeg ggtttaaeac 4943 
aggatecaca tgaatcttet gtgggecaaa atatgttcct taateettgt agaacctgte 5003 

25 ttctatattg aactagcttt ggtaeagtag agttaaetta etttccattt atccactgee 5063 
aatataaaga ggaaacaggg gttagggaaa aatgacttea ttecagagge ttcteagagt 5123 
teaacatatg ctataattta gaattttett atgaatecae tctaettggg tagaaaatat 5183 
tttatctcta gtgattgeat attatttcca tatcatagta tttcatagta ttatatttga 5243 
tatgagtgte tatateaatg tcagtgteea gaatttcgtt cetaccagtt gagtagtttt 5303 
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ctgaacggcc 


agaagaccat 


tcgaaattca 


tgatactact 


ataagttggt 


aaacaaccat 


5363 


acttttatcc 


tcatttttat 


tctcactaag aaaaaagtca 


actcccctcc 


ccttgcccaa 


5423 


gtatgaaata 


tagggacagt 


atgtatggtg tggtctcatt 


tgtttagaaa 


accacttatg 


5483 


actgggtgcg gtggctcaca 


cctgtaatcc 


cagcactttg 


ggaggctgag gcgggcgaat 


5543 


catttgaggt 


gaggagttcg 


agaccggcct 


ggccagcatg 


gtgaaacccc 


atttttgcta 


5603 


aaggtacaaa 


aattagccag 


gtgtggtggc 


acatgcctgt 


ggtcccagcc 


actggggcgg 


5663 


ctgagacgca 


ggacttgctt 


gaacccggga 


ggcagaggtt 


gcagtgagcc 


gagatggcgc 


5723 


cactgcattc 


cagcctgggc 


aacagagcaa 


gaccctgtct 


gtttcaaaac 


aaaaaacaaa 


5783 


accacttata 


ttgctagcta 


cattaagaat 


ttctgaatat 


gttactgagc 


ttgcttgtgg 


5843 


taaccattta 


taatatcaga 


aagtatatgt 


acaccaaaac 


atgttgaaca 


tccatgttgt 


5903 


acaactgaaa 


tataaataat 


tttgtcaatt 


atacctaaat 


aaaactggaa 


aaaaaaaaaa 


5963 


aaaaaaaaaa 


aaaaaaaaaa 


a 








5984 



<210> 3 
IS <211> 1295 
<212> PRT 

<213> Homo sapiens 
<400> 3 

20 Met Ala Thr Gin Gin Lys Ala Ser Asp Glu Arg I le Ser Gin Phe Asp 
15 10 15 

His Asn Leu Leu Pro Glu Leu Ser Ala Leu Leu Gly Leu Asp Ala Val 

20 25 30 

Gin Leu Ala Lys Glu Leu Glu Glu Glu Glu Gin Lys Glu Arg Ala Lys 
25 35 40 45 

Met Gin Lys Gly Tyr Asn Ser Gin Met Arg Ser Glu Ala Lys Arg Leu 

50 55 60 

Lys Thr Phe Val Thr Tyr Glu Pro Tyr Ser Ser Trp I le Pro Gin Glu 
65 70 75 80 

25 



Met Ala Ala Ala Gly Phe Tyr Phe Thr Gly Val Lys Ser Gly Me Gin 

85 90 95 

Cys Phe Cys Cys Ser Leu I le Leu Phe Gly Ala Gly Leu Thr Arg Leu 
100 105 110 

5 Pro Me Glu Asp His Lys Arg Phe His Pro Asp Cys Gly Phe Leu Leu 
115 120 125 

Asn Lys Asp Val Gly Asn Me Ala Lys Tyr Asp Me Arg Val Lys Asn 

130 135 140 

Leu Lys Ser Arg Leu Arg Gly Gly Lys Met Arg Tyr Gin Glu Glu Glu 
10 145 150 155 160 

Ala Arg Leu Ala Ser Phe Arg Asn Trp Pro Phe Tyr Val Gin Gly I le 

165 170 175 

Ser Pro Cys Val Leu Ser Glu Ala Gly Phe Val Phe Thr Gly Lys Gin 
180 185 190 

15 Asp Thr Val Gin Cys Phe Ser Cys Gly Gly Cys Leu Gly Asn Trp Glu 
1 95 200 205 

Glu Gly Asp Asp Pro Trp Lys Glu His Ala Lys Trp Phe Pro Lys Cys 

210 215 220 

Glu Phe Leu Arg Ser Lys Lys Ser Ser Glu Glu I le Thr Gin Tyr I le 
20 225 230 235 240 

Gin Ser Tyr Lys Gly Phe Val Asp Me Thr Gly Glu His Phe Val Asn 

245 250 255 

Ser Trp Val Gin Arg Glu Leu Pro Met Ala Ser Ala Tyr Cys Asn Asp 
260 265 270 

25 Ser I le Phe Ala Tyr Glu Glu Leu Arg Leu Asp Ser Phe Lys Asp Trp 
275 280 285 

Pro Arg Glu Ser Ala Val Gly Val Ala Ala Leu Ala Lys Ala Gly Leu 

290 295 300 

Phe Tyr Thr Gly Me Lys Asp Me Val Gin Cys Phe Ser Cys Gly Gly 
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305 310 315 320 

Oys Leu Glu Lys Trp Gin Glu Gly Asp Asp Pro Leu Asp Asp His Thr 

325 330 335 

Arg Cys Phe Pro Asn Cys Pro Phe Leu Gin Asn Met Lys Ser Ser Ala 
5 340 345 350 

Glu Val Thr Pro Asp Leu Gin Ser Arg Gly Glu Leu Cys Glu Leu Leu 

355 360 365 

Glu Thr Thr Ser Glu Ser Asn Leu Glu Asp Ser Me Ala Val Gly Pro 
370 375 380 

10 Me Val Pro Glu Met Ala Gin Gly Glu Ala Gin Trp Phe Gin Glu Ala 
385 390 395 400 

Lys Asn Leu Asn Glu Gin Leu Arg Ala Ala Tyr Thr Ser Ala Ser Phe 

405 410 415 

Arg His Met Ser Leu Leu Asp I le Ser Ser Asp Leu Ala Thr Asp His 
15 420 425 430 

Leu Leu Gly Cys Asp Leu Ser Me Ala Ser Lys His Me Ser Lys Pro 

435 440 445 

Val Gin Glu Pro Leu Val Leu Pro Glu Val Phe Gly Asn Leu Asn Ser 
450 455 460 

20 Val Met Cys Val Glu Gly Glu Ala Gly Ser Gly Lys Thr Val Leu Leu 
465 470 475 480 

Lys Lys Me Ala Phe Leu Trp Ala Ser Gly Cys Cys Pro Leu Leu Asn 

485 490 495 

Arg Phe Gin Leu Val Phe Tyr Leu Ser Leu Ser Ser Thr Arg Pro Asp 
25 500 505 510 

Glu Gly Leu Ala Ser Me Me Cys Asp Gin Leu Leu Glu Lys Glu Gly 

515 520 525 

Ser Val Thr Glu Met Cys Met Arg Asn Me Me Gin Gin Leu Lys Asn 
530 535 540 
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Gin Val Leu Phe Leu Leu Asp Asp Tyr Lys Glu He Cys Ser I le Pro 
545 550 555 560 

Gin Val I le Gly Lys Leu Me Gin Lys Asn His Leu Ser Arg Thr Cys 
565 570 575 

5 Leu Leu Me Ala Val Arg Thr Asn Arg Ala Arg Asp I le Arg Arg Tyr 
580 585 590 

Leu Glu Thr I le Leu Glu I le Lys Ala Phe Pro Phe Tyr Asn Thr Val 

595 600 605 

Cys I le Leu Arg Lys Leu Phe Ser His Asn Met Thr Arg Leu Arg Lys 
10 610 615 620 

Phe Met Val Tyr Phe Gly Lys Asn Gin Ser Leu Gin Lys I le Gin Lys 
625 630 635 640 

Thr Pro Leu Phe Val Ala Ala lie Cys Ala His Trp Phe Gin Tyr Pro 
645 650 655 

15 Phe Asp Pro Ser Phe Asp Asp Val Ala Val Phe Lys Ser Tyr Met Glu 
660 665 670 

Arg Leu Ser Leu Arg Asn Lys Ala Thr Ala Glu Me Leu Lys Ala Thr 

675 680 685 

Val Ser Ser Cys Gly Glu Leu Ala Leu Lys Gly Phe Phe Ser Cys Cys 
20 690 695 700 

Phe Glu Phe Asn Asp Asp Asp Leu Ala Glu Ala Gly Val Asp Glu Asp 
705 710 715 720 

Glu Asp Leu Thr Met Cys Leu Met Ser Lys Phe Thr Ala Gin Arg Leu 
725 730 735 

25 Arg Pro Phe Tyr Arg Phe Leu Ser Pro Ala Phe Gin Glu Phe Leu Ala 
740 745 750 

Gly Met Arg Leu Me Glu Leu Leu Asp Ser Asp Arg Gin Glu His Gin 

755 760 765 

Asp Leu Gly Leu Tyr His Leu Lys Gin Me Asn Ser Pro Met Met Thr 
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770 775 780 

Val Ser Ala Tyr Asn Asn Phe Leu Asn Tyr Val Ser Ser Leu Pro Ser 
785 790 795 800 

Thr Lys Ala Gly Pro Lys lie Val Ser His Leu Leu His Leu Val Asp 
5 805 810 815 

Asn Lys Glu Ser Leu Glu Asn Me Ser Glu Asn Asp Asp Tyr Leu Lys 

820 825 830 

His Gin Pro Glu I le Ser Leu Gin Met Gin Leu Leu Arg Gly Leu Trp 
835 840 845 

10 Gin Me Cys Pro Gin Ala Tyr Phe Ser Met Val Ser Glu His Leu Leu 
850 855 860 

Val Leu Ala Leu Lys Thr Ala Tyr Gin Ser Asn Thr Val Ala Ala Cys 
865 870 875 880 

Ser Pro Phe Val Leu Gin Phe Leu Gin Gly Arg Thr Leu Thr Leu Gly 
15 885 890 895 

Ala Leu Asn Leu Gin Tyr Phe Phe Asp His Pro Glu Ser Leu Ser Leu 

900 905 910 

Leu Arg Ser Me His Phe Pro I le Arg Gly Asn Lys Thr Ser Pro Arg 
915 920 925 

20 Ala His Phe Ser Val Leu Glu Thr Cys Phe Asp Lys Ser Gin Val Pro 
930 935 940 

Thr Me Asp Gin Asp Tyr Ala Ser Ala Phe Glu Pro Met Asn Glu Trp 
945 950 955 960 

Glu Arg Asn Leu Ala Glu Lys Glu Asp Asn Val Lys Ser Tyr Met Asp 
25 965 970 975 

Met Gin Arg Arg Ala Ser Pro Asp Leu Ser Thr Gly Tyr Trp Lys Leu 

980 985 990 

Ser Pro Lys Gin Tyr Lys Me Pro Cys Leu Glu Val Asp Val Asn Asp 
995 1000 1005 
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I le Asp Val Val Gly Gin Asp Met Leu Glu I le Leu Met Thr Val Phe 

1010 1015 1020 

Ser Ala Ser Gin Arg I le Glu Leu His Leu Asn His Ser Arg Gly Phe 
025 1030 1035 1040 

5 I le Glu Ser I le Arg Pro Ala Leu Glu Leu Ser Lys Ala Ser Val Thr 

1045 1050 1055 

Lys Cys Ser I le Ser Lys Leu Glu Leu Ser Ala Ala Glu Gin Glu Leu 

1060 1065 1070 

Leu Leu Thr Leu Pro Ser Leu Glu Ser Leu Glu Val Ser Gly Thr Me 
10 1075 1080 1085 

Gin Ser Gin Asp Gin Me Phe Pro Asn Leu Asp Lys Phe Leu Cys Leu 

1090 1095 1100 

Lys Glu Leu Ser Val Asp Leu Glu Gly Asn Me Asn Val Phe Ser Val 
105 1110 1115 1120 

15 Me Pro Glu Glu Phe Pro Asn Phe His His Met Glu Lys Leu Leu I le 

1125 1130 1135 

Gin Me Ser Ala Glu Tyr Asp Pro Ser Lys Leu Val Lys Leu Me Gin 

1140 1145 1150 

Asn Ser Pro Asn Leu His Val Phe His Leu Lys Cys Asn Phe Phe Ser 
20 1155 1160 1165 

Asp Phe Gly Ser Leu Met Thr Met Leu Val Ser Cys Lys Lys Leu Thr 

1170 1175 1180 

Glu Me Lys Phe Ser Asp Ser Phe Phe Gin Ala Val Pro Phe Val Ala 
185 1190 1195 1200 

25 Ser Leu Pro Asn Phe Me Ser Leu Lys I le Leu Asn Leu Glu Gly Gin 

1205 1210 1215 

Gin Phe Pro Asp Glu Glu Thr Ser Glu Lys Phe Ala Tyr Me Leu Gly 

1220 1225 1230 

Ser Leu Ser Asn Leu Glu Glu Leu I le Leu Pro Thr Gly Asp Gly Me 

30 



1235 1240 1245 

Tyr Arg Val Ala Lys Leu Me Me Gin Gin Cys Gin Gin Leu His Gys 

1250 1255 1260 

Leu Arg Val Leu Ser Phe Phe Lys Thr Leu Asn Asp Asp Ser Val Val 
265 1270 1275 1280 

Glu I le Gly Glu Leu Val Phe Gin Leu Ala Trp Lys Pro Val Val 
1285 1290 1295 

<210> 4 

<211> 5366 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (292).. (4176) 
<400> 4 

acaaaaggtc ctgtgctcac ctgggaccct tctggacgtt gccctgtgtt cctcttcgcc 60 
tgcctgttca tctacgacga accccgggta ttgaccccag acaacaatgc cacttcatat 120 
tggggacttc gtctgggatt ccaaggtgca ttcattgcaa agttccttaa atattttctc 180 
actgcttcct actaaaggac ggacagagca tttgttcttc agccacatac tttccttcca 240 
ctggccagca ttctcctcta ttagactaga actgtggata aacctcagaa a atg gcc 297 

Met Ala 
1 

acc cag cag aaa gcc tct gac gag agg ate tec cag ttt gat cae aat 345 
Thr Gin Gin Lys Ala Ser Asp Glu Arg I le Ser Gin Phe Asp His Asn 

5 10 15 

ttg ctg cca gag ctg tct get ctt ctg ggc eta gat gea gtt cag ttg 393 



Leu Leu Pro Glu Leu Ser Ala Leu Leu Gly Leu Asp Ala Val Gin Leu 

20 25 30 

gca aag gaa eta gaa gaa gag gag cag aag gag cga gca aaa atg cag 441 
Ala Lys Glu Leu Glu Glu Glu Glu Gin Lys Glu Arg Ala Lys Met Gin 
5 35 40 45 50 

aaa ggc tac aac tct caa atg cgc agt gaa gca aaa agg tta aag act 489 
Lys Gly Tyr Asn Ser Gin Met Arg Ser Glu Ala Lys Arg Leu Lys Thr 

55 60 65 

ttt gtg act tat gag ccg tac age tea tgg ata cca cag gag atg gcg 537 
10 Phe Val Thr Tyr Glu Pro Tyr Ser Ser Trp Me Pro Gin Glu Met Ala 
70 75 80 

gee get ggg ttt tac ttc act ggg gta aaa tct ggg att cag tgc tte 585 
Ala Ala Gly Phe Tyr Phe Thr Gly Val Lys Ser Gly Me Gin Cys Phe 
85 90 95 

15 tgc tgt age eta ate etc ttt ggt gee ggc etc acg aga etc cec ata 633 
Cys Cys Ser Leu I le Leu Phe Gly Ala Gly Leu Thr Arg Leu Pro I le 

100 105 110 

gaa gac cac aag agg ttt eat cca gat tgt ggg ttc ett ttg aac aag 681 
Glu Asp His Lys Arg Phe His Pro Asp Cys Gly Phe Leu Leu Asn Lys 
20 115 120 125 130 

gat gtt ggt aac att gee aag tac gac ata agg gtg aag aat ctg aag 729 
Asp Val Gly Asn I le Ala Lys Tyr Asp I le Arg Val Lys Asn Leu Lys 

135 140 145 

age agg ctg aga gga ggt aaa atg agg tac caa gaa gag gag get aga 777 
25 Ser Arg Leu Arg Gly Gly Lys Met Arg Tyr Gin Glu Glu Glu Ala Arg 
150 155 160 

ett gca tec tte agg aac tgg cca ttt tat gtc caa ggg ata tee cet 825 
Leu Ala Ser Phe Arg Asn Trp Pro Phe Tyr Val Gin Gly Me Ser Pro 
165 170 175 
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tgt gtg etc tea gag get gge ttt gtc ttt aea ggt aaa eag gac aeg 873 
Cys Va! Leu Ser Glu Ala Gly Phe Val Phe Thr Gly Lys Gin Asp Thr 

180 185 190 

gta cag tgt ttt tec tgt ggt gga tgt tta gga aat tgg gaa gaa gga 921 
5 Val Gin Cys Phe Ser Cys Gly Gly Cys Leu Gly Asn Trp Glu Glu Gly 
195 200 205 210 

gat gat eet tgg aag gaa eat gee aaa tgg tte ecc aaa tgt gaa ttt 969 
Asp Asp Pro Trp Lys Glu His Ala Lys Trp Phe Pro Lys Cys Glu Phe 
215 220 225 

10 ctt egg agt aag aaa tec tea gag gaa att acc cag tat att caa age 1017 
Leu Arg Ser Lys Lys Ser Ser Glu Glu Me Thr Gin Tyr Me Gin Ser 

230 235 240 

tae aag gga ttt gtt gae ata aeg gga gaa eat ttt gtg aat tee tgg 1065 
Tyr Lys Gly Phe Val Asp lie Thr Gly Glu His Phe Val Asn Ser Trp 
15 245 250 255 

gtc cag aga gaa tta cct atg gea tea get tat tgc aat gac age ate 1113 
Val Gin Arg Glu Leu Pro Met Ala Ser Ala Tyr Cys Asn Asp Ser Me 

260 265 270 

ttt get tae gaa gaa eta egg ctg gae tet ttt aag gac tgg ecc egg 1161 
20 Phe Ala Tyr Glu Glu Leu Arg Leu Asp Ser Phe Lys Asp Trp Pro Arg 
275 280 285 290 

gaa tea get gtg gga gtt gea gca ctg gee aaa gea ggt ctt tte tac 1209 
Glu Ser Ala Val Gly Val Ata Ala Leu Ala Lys Ala Gly Leu Phe Tyr 
295 300 305 

25 aea ggt ata aag gac ate gtc cag tgc ttt tec tgt gga ggg tgt tta 1257 
Thr Gly Me Lys Asp Me Val Gin Cys Phe Ser Cys Gly Gly Cys Leu 

310 315 320 

gag aaa tgg cag gaa ggt gat gac cca tta gac gat eac acc aga tgt 1305 
Glu Lys Trp Gin Glu Gly Asp Asp Pro Leu Asp Asp His Thr Arg Cys 



33 



325 330 335 

ttt ccc aat tgt cca ttt etc caa aat atg aag tec tet geg gaa gtg 1353 

Phe Pro Asn Cys Pro Phe Leu Gin Asn Met Lys Ser Ser Ala Glu Val 
340 345 350 

5 act cca gac ctt cag age cgt ggt gaa ett tgt gaa tta ctg gaa acc 1401 

Thr Pro Asp Leu Gin Ser Arg Gly Glu Leu Cys Glu Leu Leu Glu Thr 

355 360 365 370 

aca agt gaa age aat ctt gaa gat tea at a gca gtt ggt cct ata gtg 1449 

Thr Ser Glu Ser Asn Leu Glu Asp Ser Me Ala Val Gly Pro Me Val 
10 375 380 385 

cca gaa atg gca cag ggt gaa gee cag tgg ttt caa gag gca aag aat 1497 

Pro Glu Met Ala Gin Gly Glu Ala Gin Trp Phe Gin Glu Ala Lys Asn 

390 395 400 

ctg aat gag eag ctg aga gca get tat acc age gee agt tte ege eac 1545 

15 Leu Asn Glu Gin Leu Arg Ala Ala Tyr Thr Ser Ala Ser Phe Arg His 
405 410 415 

atg tet ttg ctt gat ate tct tec gat ctg gee aeg gac eac ttg ctg 1593 

Met Ser Leu Leu Asp I le Ser Ser Asp Leu Ala Thr Asp His Leu Leu 
420 425 430 

20 gge tgt gat ctg tet att get tea aaa eac ate age aaa cct gtg caa 1641 

Gly Cys Asp Leu Ser I le Ala Ser Lys His I le Ser Lys Pro Val Gin 

435 440 445 450 

gaa cct ctg gtg ctg cct gag gtc ttt gge aac ttg aac tct gtc atg 1689 

Glu Pro Leu Val Leu Pro Glu Val Phe Gly Asn Leu Asn Ser Val Met 
25 455 460 465 

tgt gtg gag ggt gaa get gga agt gga aag aeg gtc etc ctg aag aaa 1737 

Cys Val Glu Gly Glu Ala Gly Ser Gly Lys Thr Val Leu Leu Lys Lys 

470 475 480 

ata get ttt ctg tgg gca tct gga tgc tgt ccc ctg tta aac agg tte 1785 
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Me Ala Phe Leu Trp Ala Ser Gly Cys Cys Pro Leu Leu Asn Arg Phe 

485 490 495 

cag ctg gtt ttc tac etc tec ctt agt tee ace aga eea gae gag ggg 1833 
Gin Leu Val Phe Tyr Leu Ser Leu Ser Ser Thr Arg Pro Asp Glu Gly 
5 500 505 510 

ctg gcc agt ate ate tgt gae cag etc eta gag aaa gaa gga tct gtt 1881 
Leu Ala Ser lie Me Cys Asp Gin Leu Leu Glu Lys Glu Gly Ser Val 
515 520 525 530 

act gaa atg tge atg agg aac att ate cag cag tta aag aat cag gtc 1929 
10 Thr Glu Met Cys Met Arg Asn Me Me Gin Gin Leu Lys Asn Gin Val 

535 540 545 

tta ttc ctt tta gat gae tac aaa gaa at a tgt tea ate cet caa gtc 1977 
Leu Phe Leu Leu Asp Asp Tyr Lys Glu Me Cys Ser I le Pro Gin Val 
550 555 560 

15 ata gga aaa ctg att caa aaa aac cac tta tec egg ace tge eta ttg 2025 
Me Gly Lys Leu Me Gin Lys Asn His Leu Ser Arg Thr Cys Leu Leu 

565 570 575 

att get gtc cgt aca aac agg gcc agg gae ate cge ega tac eta gag 2073 
Me Ala Val Arg Thr Asn Arg Ala Arg Asp Me Arg Arg Tyr Leu Glu 
20 580 585 590 

ace att eta gag ate aaa gea ttt cec ttt tat aat act gtc tgt ata 2121 
Thr I le Leu Glu I le Lys Ala Phe Pro Phe Tyr Asn Thr Val Cys I le 
595 600 605 610 

tta egg aag etc ttt tea cat aat atg act cgt ctg ega aag ttt atg 2169 
25 Leu Arg Lys Leu Phe Ser His Asn Met Thr Arg Leu Arg Lys Phe Met 

615 620 625 

gtt tac ttt gga aag aac caa agt ttg cag aag ata cag aaa act cet 2217 
Val Tyr Phe Gly Lys Asn Gin Ser Leu Gin Lys Me Gin Lys Thr Pro 
630 635 640 
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etc ttt gtg gcg gcg ate tgt get cat tgg ttt cag tat cet ttt gac 2265 
Leu Phe Val Ala Ala Me Cys Ala His Trp Phe Gin Tyr Pro Phe Asp 

645 650 655 

oca tec ttt gat gat gtg get gtt ttc aag tee tat atg gaa ege ett 2313 
5 Pro Ser Phe Asp Asp Val Ala Val Phe Lys Ser Tyr Met Glu Arg Leu 
660 665 670 

tec tta agg aac aaa gcg aca get gaa att etc aaa gca act gtg tec 2361 
Ser Leu Arg Asn Lys Ala Thr Ala Glu Me Leu Lys Ala Thr Val Ser 
675 680 685 690 

10 tee tgt ggt gag ctg gee ttg aaa ggg ttt ttt tea tgt tgc ttt gag 2409 
Ser Cys Gly Glu Leu Ala Leu Lys Gly Phe Phe Ser Cys Cys Phe Glu 

695 700 705 

ttt aat gat gat gat etc gca gaa gca ggg gtt gat gaa gat gaa gat 2457 
Phe Asn Asp Asp Asp Leu Ala Glu Ala Gly Val Asp Glu Asp Glu Asp 
15 710 715 720 

eta ace atg tge ttg atg age aaa ttt aca gee eag aga eta aga eea 2505 
Leu Thr Met Cys Leu Met Ser Lys Phe Thr Ala Gin Arg Leu Arg Pro 

725 730 735 

tte tac egg ttt tta agt ect gee ttc caa gaa ttt ett gcg ggg atg 2553 
20 Phe Tyr Arg Phe Leu Ser Pro Ala Phe Gin Glu Phe Leu Ala Gly Met 
740 745 750 

agg ctg att gaa etc etg gat tea gat agg eag gaa cat caa gat ttg 2601 
Arg Leu Me Glu Leu Leu Asp Ser Asp Arg Gin Glu His Gin Asp Leu 
755 760 765 770 

25 gga etg tat eat ttg aaa caa ate aac tea ecc atg atg act gta age 2649 
Gly Leu Tyr His Leu Lys Gin Me Asn Ser Pro Met Met Thr Val Ser 

775 780 785 

gee tac aac aat ttt ttg aac tat gte tec age etc ect tea aca aaa 2697 
Ala Tyr Asn Asn Phe Leu Asn Tyr Val Ser Ser Leu Pro Ser Thr Lys 
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790 795 800 

gca ggg ccc aaa att gtg tct cat ttg etc cat tta gtg gat aac aaa 2745 
Ala Gly Pro Lys Me Val Ser His Leu Leu His Leu Val Asp Asn Lys 
805 810 815 

5 gag tea ttg gag aat ata tct gaa aat gat gac tae tta aag cac cag 2793 
Glu Ser Leu Glu Asn I le Ser Glu Asn Asp Asp Tyr Leu Lys His Gin 

820 825 830 

cca gaa att tea ctg cag atg cag tta ctt agg gga ttg tgg caa att 2841 
Pro Glu I le Ser Leu Gin Met Gin Leu Leu Arg Gly Leu Trp Gin I le 
10 835 840 845 850 

tgt cca caa get tae ttt tea atg gtt tea gaa cat tta ctg gtt ctt 2889 
Cys Pro Gin Ala Tyr Phe Ser Met Val Ser Glu His Leu Leu Val Leu 

855 860 865 

gcc ctg aaa act get tat caa age aac act gtt get gcg tgt tct cca 2937 
15 Ala Leu Lys Thr Ala Tyr Gin Ser Asn Thr Val Ala Ala Cys Ser Pro 
870 875 880 

ttt gtt ttg caa ttc ctt caa ggg aga aca ctg act ttg ggt gcg ctt 2985 
Phe Val Leu Gin Phe Leu Gin Gly Arg Thr Leu Thr Leu Gly Ala Leu 
885 890 895 

20 aac tta cag tac ttt ttc gac cac cca gaa age ttg tea ttg ttg agg 3033 
Asn Leu Gin Tyr Phe Phe Asp His Pro Glu Ser Leu Ser Leu Leu Arg 

900 905 910 

age ate cac ttc cca ata ega gga aat aag aca tea ccc aga gea eat 3081 
Ser I le His Phe Pro I le Arg Gly Asn Lys Thr Ser Pro Arg Ala His 
25 915 920 925 930 

ttt tea gtt ctg gaa aca tgt ttt gac aaa tea cag gtg cca act ata 3129 
Phe Ser Val Leu Glu Thr Cys Phe Asp Lys Ser Gin Val Pro Thr Me 

935 940 945 

gat cag gac tat get tct gcc ttt gaa ect atg aat gaa tgg gag ega 3177 
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Asp Gin Asp Tyr Ala Ser Ala Phe Glu Pro Met Asn Glu Trp Glu Arg 

950 955 960 

aat tta get gaa aaa gag gat aat gta aag age tat atg gat atg eag 3225 
Asn Leu Ala Glu Lys Glu Asp Asn Val Lys Ser Tyr Met Asp Met Gin 
5 965 970 975 

cgc agg gca tea cca gac ett agt act ggc tat tgg aaa ett tet eca 3273 
Arg Arg Ala Ser Pro Asp Leu Ser Thr Gly Tyr Trp Lys Leu Ser Pro 

980 985 990 

aag eag tae aag att ecc tgt eta gaa gte gat gtg aat gat att gat 3321 
10 Lys Gin Tyr Lys Me Pro Cys Leu Glu Val Asp Val Asn Asp Me Asp 
995 1000 1005 1010 

gtt gta ggc cag gat atg ett gag att eta atg aea gtt tte tea get 3369 
Val Val Gly Gin Asp Met Leu Glu I le Leu Met Thr Val Phe Ser Ala 
1015 1020 1025 

15 tea eag ege ate gaa ete eat tta aae eae age aga gge ttt ata gaa 3417 
Ser Gin Arg I le Glu Leu His Leu Asn His Ser Arg Gly Phe I le Glu 

1030 1035 1040 

age ate ege eea get ett gag etg tet aag gee tet gte aee aag tge 3465 
Ser Me Arg Pro Ala Leu Glu Leu Ser Lys Ala Ser Val Thr Lys Cys 
20 1045 1050 1055 

tee ata age aag ttg gaa ete age gea gee gaa eag gaa etg ett ete 3513 
Ser Me Ser Lys Leu Glu Leu Ser Ala Ala Glu Gin Glu Leu Leu Leu 

1060 1065 1070 

aee etg eet tee etg gaa tet ett gaa gte tea ggg aea ate eag tea 3561 
25 Thr Leu Pro Ser Leu Glu Ser Leu Glu Val Ser Gly Thr Me Gin Ser 
1075 1080 1085 1090 

caa gae eaa ate ttt eet aat etg gat aag ttc etg tgc etg aaa gaa 3609 
Gin Asp Gin Me Phe Pro Asn Leu Asp Lys Phe Leu Cys Leu Lys Glu 
1095 1100 1105 
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ctg tct gtg gat ctg gag ggc aat ata aat gtt ttt tea gtc att cct 3657 
Leu Ser Val Asp Leu Glu Gly Asn Me Asn Val Ph Ser Val Me Pro 

1110 1115 1120 

gaa gaa ttt cca aac ttc cac cat atg gag aaa tta ttg ate caa att 3705 
5 Glu Glu Phe Pro Asn Phe His His Met Glu Lys Leu Leu I le Gin I le 
1125 1130 1135 

tea get gag tat gat eet tec aaa eta gta aaa tta att caa aat tct 3753 
Ser Ala Glu Tyr Asp Pro Ser Lys Leu Val Lys Leu Me Gin Asn Ser 
1140 1145 1150 

10 cca aac ctt cat gtt ttc cat ctg aag tgt aac ttc ttt teg gat ttt 3801 
Pro Asn Leu His Val Phe His Leu Lys Cys Asn Phe Phe Ser Asp Phe 
1155 1160 1165 1170 

ggg tct etc atg act atg ctt gtt tec tgt aag aaa etc aca gaa att 3849 
Gly Ser Leu Met Thr Met Leu Val Ser Cys Lys Lys Leu Thr Glu Me 
15 1175 1180 1185 

aag ttt teg gat tea ttt ttt caa gee gtc cca ttt gtt gee agt ttg 3897 
Lys Phe Ser Asp Ser Phe Phe Gin Ala Val Pro Phe Val Ala Ser Leu 

1190 1195 1200 

cca aat ttt att tct ctg aag ata tta aat ctt gaa ggc cag caa ttt 3945 
20 Pro Asn Phe Me Ser Leu Lys Me Leu Asn Leu Glu Gly Gin Gin Phe 
1205 1210 1215 

cct gat gag gaa aca tea gaa aaa ttt gee tae att tta ggt tet ett 3993 
Pro Asp Glu Glu Thr Ser Glu Lys Phe Ala Tyr I le Leu Gly Ser Leu 
1220 1225 1230 

25 agt aac ctg gaa gaa ttg ate ctt cct act ggg gat gga att tat ega 4041 
Ser Asn Leu Glu Glu Leu I le Leu Pro Thr Gly Asp Gly I le Tyr Arg 
1235 1240 1245 1250 

gtg gee aaa etg ate ate eag eag tgt eag eag ctt eat tgt etc ega 4089 
Val Ala Lys Leu Me Me Gin Gin Cys Gin Gin Leu His Cys Leu Arg 
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1255 1260 1265 

gtc etc tea ttt tte aag aet ttg aat gat gae age gtg gtg gaa att 4137 
Val Leu Ser Phe Phe Lys Thr Leu Asn Asp Asp Ser Val Val 6lu I le 
1270 1275 1280 



5 ggt gag eta gtg ttt eag ett gea tgg aag eea gtg gta tag 4179 
Gly Glu Leu Val Phe Gin Leu Ala Trp Lys Pro Val Val 





1285 


i 1290 


1295 








eeaagettte 


tgetgeaaea tgtetatgta aaeatttgee 


eetctagaaa 


tttteaaeee 


4239 




gctteeteat 


ttteaetate ataetgttee ttctagtgte 


ettetgtgga 


tttaggegea 


4299 


10 


ttetggteag 


atttggaagt aeaaaaaggt cteeeatttg 


tggatataea 


agceeteaaa 


4359 




tetgegttet 


tgecaeetgg tgttttagae aeetggeeae 


ataeteteet 


aagtaeteet 4419 




ttttaaaaet 


gaagatgaat ataeaeacag aaaagtaeaa 


aaateatgtg 


tactgeteae 


4479 




tgaattttat 


tttcttattt tcttcttttt tttttttttg 


agacagagtt 


tcgctcgtgt 


4539 




tgeeeagget 


ggagtaeaat ggeacgatet egggteactg 


eaaaetetgc 


etcetgggtt 


4599 


15 


eaagegatte 


teetgeetea geeteeeaag tagetaggat 


tacaggtgaa 


egeeaeeaea 


4659 




eetggetaat 


tttgtatttt tagtaaaeae agggttteae 


eatgttggee 


aggetagtet 


4719 




egaaetcetg 


aeeteaagtg ageeaeagtg eetggeetga 


ggaaetgaga 


tttetgtega 


4779 




gaeetgaagg 


gagaatggee eaggeatagt tggtagagga 


ggaattgaga 


eateatttea 


4839 




aacagaggta 


atcacttgtg tcatagcctg gagttaaaga 


gaaccagata 


tatttgaaga 


4899 


20 


aettggggga 


aaaaaaggaa tgtctggagc aagaggcagg 


agtgagttgt 


gagaagaaga 


4959 




etggagagga 


aagtaaaagc eeaattggag aggetttgte 


gggtgtgtta 


eaagggetgg 


5019 




ateteatttt 


ettaetgete ageaetgtta ttttaegtta 


tttaaaaeag 


etgggagegg 


5079 




tggeteaage 


ttgtaateee ageactttgg gaggeegagg 


eggatggate 


aegaggteag 5139 




gagategaga 


eeateetgge taaeatggtg aaaeeeegte 


tetaetaaaa 


ataeaaaaaa 


5199 


25 


ttagccaggc 


gtgatggcgg gcacctgtag tcccagetac 


tegggagget 


gaggeaggag 


5259 




aatggtgtga 


acecgggagg tggagcttga agtgagccaa 


gatcatgcca 


ctgcaeteca 


5319 




geetgggeaa 


eagaaegaga eteegtctca aaaaaaaaaa 


aeaaaaa 




5366 



<210> 5 
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<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence: Synthesized oligonucleotide 

<400> 5 

gaggaattcc tacattttag gttctcttag t 

<210> 6 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Artificial Sequence: Synthesized oligonucleotide 
<400> 6 

gaggaattca atttccacca cgctgtca 

<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial Sequence: Synthesized oligonucleotide 



<400> 7 

aattaaccct cactaaaggg 



<210> 8 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Artificial Sequence: Synthesized oligonucleotide 

<400> 8 

cattatgctg agtgtgatat cccg 24 
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